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Disclosed are gene sequences encoding 7-tocopherol methyltransferases from photosynthetic organisms. The enzyme 7-tocopherol 
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tocopherol profile of the plant. Transgenic plants can be made that have a-tocopherol as the predominant tocopherol in their seeds and oils. 
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TRANSGENIC PLANTS WITH TOCOPHEROL METHYLTRANSFERASE 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority to U.S. Provisional 
Application Serial No. 60/053,819 filed July 25, 1997 and U.S. 
Provisional Application Serial No. 60/072,497 filed January 26, 
1998. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
Not applicable. 

BACKGROUND OF THE INVENTION 
Vitamin E is an essential component of mammalian diets. 
Epidemiological evidence indicates that Vitamin E 
supplementation results in decreased risk for cardiovascular 
disease and cancer, aids in immune function, and generally 
prevents or slows a number of degenerative disease processes in 
humans (Traber and Sies, Annu. Rev. Nutr. 16:321-347, 1996). 
Vitamin E functions in stabilizing the lipid bilayer of 
biological membranes (Skrypin and Kagan, Biochim. Bioohvs . Acta 
815:209 1995; Kagan, N.Y. Acad. Sci . p 121, 1989; Gomez- 
Fernandez et al., Ann. N.Y. Acad. Sci. p 109, 1989), reducing 
polyunsaturated fatty acid (PUFA) free radicals generated by 
lipid oxidation (Fukuzawa et al . , Lipids 17: 511-513, 1982), 
and quenching singlet oxygen species (Fryer, Plant Cell 
Environ. 15 (4) ; 381-392 . 1992). 

Vitamin E, or a- tocopherol , belongs to a class of lipid- 
soluble antioxidants that includes a, (3, y t and 5- tocopherols 
and a, p, y, and 5-tocotrienols . Although a, p, y t and 5- 
tocopherols and a, 3, y, and 5-tocotrienols are sometimes 
referred to collectively as "Vitamin E" in the popular press, 
Vitamin E is properly defined chemically solely as gc- 
tocopherol . Of the various tocopherols present in foodstuff, 
a-tocopherol is the most significant for human health both 

-1- 



BNSDOCID: <WO 9904622A1_I_> 



WO 99/04622 PCT/US98/15137 

because it is the most bioactive of the tocopherols and also 
because it is the tocopherol most readily absorbed and retained 
by the body (Traber and Sies, Annu . Rev . Nut r . 16:321-347, 
1996) . The in vivo antioxidant activity of a- tocopherol is 
5 higher than the antioxidant activities of P, y» and 5- 

tocopherol (Kamal-Eldin and Appelqzvist Lipids 31:671-701, 
1996) . 

Only plants and certain other photosynthetic organisms, 
including cyanobacteria , synthesize tocopherols. Therefore, 

10 dietary tocopherols are obtained almost exclusively from 

plants. Plant tissues vary considerably in total tocopherol 
content and tocopherol composition. The predominant tocopherol 
in green, photosynthetic plant tissues often is a- tocopherol . 
Leaf tissue can contain from 10-50 /xg total tocopherols/gram 

15 fresh weight. 

Non-green plant tissues and organs exhibit a wider range 
of both total tocopherol levels and tocopherol compositions. 
In general, most of the major food staple corps (e.g., rice, 
corn, wheat, potato) produce low to extremely low levels of 

20 total tocopherols, of which only a small percentage is a- 

tocopherol (Hess, Vitamin E, a- tocopherol , In Antioxidants in 
Higher Plants, R. Alscher and J. Hess, Eds. 1993, CRC Press, 
Boca Raton, pp 111-134) . Oil seed crops generally contain much 
higher levels of total tocopherols; however, a- tocopherol is 

25 present only as a minor component and 3/ Y# anc * 5- tocopherols 

and tocotrienols predominate (Taylor and Barnes, Chemy Ind . , 
Oct . :722-726, 1981) . 

Daily dietary intake of 15-30 mg of vitamin E is 
recommended to obtain optimal plasma a-tocopherol levels. It 

30 is quite difficult to achieve this level of vitamin E intake 

from the average American diet. For example, one could obtain 
the recommended daily dose of Vitamin E by daily consumption of 
over 750 grams of spinach leaves (in which a-tocopherol 
comprises 60% of total tocopherols) or 200-400 grams of soybean 

35 oil. 

One alternative to relying on diet alone to obtain the 
recommended levels of vitamin E is to take, a vitamin E 
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BNSDOCID: <WO 9904622A1 J_> 



WO 99/04622' 



PCT/US98/15137 



supplement. However, most vitamin E supplements are synthetic 
vitamin E having six stereoisomers, whereas natural vitamin E . 
vitamin is a single isomer. Furthermore, supplements tend to 
be relatively expensive, and the general population is 
5 disinclined to take vitamin supplements on a regular basis. 

Although tocopherol function in plants has been less 
extensively studied than tocopherol function in mammalian 
systems, it is likely that the analogous functions performed by 
tocopherols in animals also occur in plants. In general, plant 

10 tocopherol levels have been found to increase with increases in 
various stresses, especially oxidative stress. Increased a- 
tocopherol levels in crops are associated with enhanced 
stability and extended shelf life of fresh and processed plant 
products (Peterson, Cereal -Chem 72(l):21-24, 1995; Ball, Ffrt- 

15 soluble vitamin assays in food analysis. A comprehensive 

review. London: Elsevier Science Publishers LTD, 1988) . 

Vitamin E supplementation of swine, beef, and poultry 
feeds has been shown to significantly increase meat quality and 
extend the shelf life of post -processed meat products by 

20 retarding post -processing lipid oxidation, which contributes to 

the formation of undesirable flavor components (Ball, supra 
1988; Sante and Lacourt, J. Sci , Food Agric . 65 (4 ): 503 -507 , 
1994; Buckley et al . , J. of Ani mal Science 73:3122-3130, 1995). 
What would be useful for the art is a method to increase 

25 the ratio of a-tocopherol to y- tocopherol in seeds, oils, and 
leaves from crop and forage plants, or a method for producing 
natural vitamin E in nonp ho to synthetic bacteria or fungi using 
a large scale fermentation process. Increasing a-tocopherol 
levels in crop plants would increase the amount of a-tocopherol 

30 obtained in the human diet, and would enhance the stability and 

shelf life of plants and plant products. The meat industry 
would benefit from the development of forage plants having 
increased levels of vitamin E. 



BRIEF SUMMARY OF THE INVENTION 
35 The present invention is based on an isolated DNA fragment 

including a coding sequence for a Y~ toco Pherol 
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methyltransf erase . 

The invention is also a heterologous genetic construct 
comprising a y-tocopherol methyltransf erase coding sequence 
operably connected to a plant, bacterial, or fungal promoter 
5 not natively associated with the y-tocopherol methyltransf erase 

coding sequence . 

Another aspect of the present invention is a method of 
altering the tocopherol profile of a plant comprising the steps 
of: (a) providing a heterologous genetic construct comprising a 
10 Y" tocopherol methyltransf erase coding sequence operably 

connected to a plant promoter not natively associated with the 
coding sequence; and (b) introducing the construct into the 
genome of a plant . 

The present invention is also directed toward transgenic 
15 plants which have an altered ratio of a-tocopherol to y- 

tocopherol, thus increasing the nutritive value of the plants 
and products therefrom for human and animals. 

In another embodiment, the invention is a plant comprising 
in its genome a heterologous genetic construct comprising a y- 
2 0 tocopherol methyltransf erase coding sequence operably connected 

to a promoter that is functional in plants. 

It is an object of the present invention to provide a 
genetic construct comprising a coding sequence for a y~ 
tocopherol methyltransf erase operably connected to a plant 
25 promoter not natively associated with the coding sequence which 

when expressed in a plant comprising the construct in its 
genome results in an alteration in the ratio of a- tocopherol :y- 
tocopherol in the plant, relative to an untransf ormed wild- type 
plant . 

30 It is an object of this invention to provide a plant 

having an altered a- tocopherol : y- tocopherol ratio. 

Other objects, features, and advantages of the invention 
will become apparent upon review of the specification and 
claims . 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 
Figure 1 shows the alignment of amino acid sequences of y- 
tocopherol methyl-transf erases from Arabidopsis tha.lia.na and 
Synechocystis . Inverted triangles denote putative cleavage 
sites of N- terminal targeting domains; the closed circle 
denotes the position of an in- frame Ncol site in the leader 
peptide of SLR0089. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is, in part, directed to a plant 
comprising in its genome a genetic construct comprising a y- 
tocopherol methyltransf erase coding sequence operably connected 
to a plant promoter not natively associated with the coding 
sequence. Such transgenic plants exhibit an altered ratio 
relative to the wild type plants of the same species. In fact, 
seed and seed oil of a plant not normally containing a- 
tocopherol can be altered so that the most abundant tocopherol 
is a-tocopherol . Alternatively, the relative percentage of y- 
tocopherol present in plant tissue may be increased by reducing 
the activity of y- tocopherol methyltransf erase in the plant, 
which could be accomplished by expression of a y- tocopherol 
methyltransf erase coding sequence in the antisense orientation. 
The development of plants with increased Y* toco P hero1 ma Y be 
useful in certain industries. 

Tocopherols and plastoquinones , the most abundant quinones 
in plant plastids, are synthesized by a common pathway (Hess, 
Antioxidants in Higher Plants . CRC Press: Boca Raton p 14 0-152, 
1993; Soil, Plant Cell Membranes , Academic Press: San Diego p 
383-392, 1987) . The synthesis of tocopherols involves four 
steps catalyzed by at least six enzymatic activities. A 
branchpoint in the common pathway occurs upon phytylation or 
prenylation of the precursor homogentisic acid to form either 
2-methyl-6-phytylplastoquinol or 2 -methyl -6- 
solanylplastoquinol, intermediates in tocopherol and 
plastoquinone biosynthesis, respectively. 

The intermediate 2-methyl-6-phytylplastoquinol is the 
common precursor to the biosynthesis of all tocopherols. In 
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spinach leaves, the intermediate undergoes ring methylation to 
yield 2 , 3 -dimethyl- 6 -phytylplastoquinol , which is cyclized to 
form y- tocopherol . A second ring methylation at position 5 
yields a- tocopherol (Soli and Schultz, Phytochemistry 
5 19 (2) :215-218, 1980). The second ring methylation is catalyzed 

by y~ tocopherol methyltransf erase , a distinct enzymatic 
activity from the methyltransf erase that catalyzes the 
methylation at position 7, and the only enzyme of the pathway 
that has been purified from plants (d'Harlingue and Camara, 

10 Biol. Chem. 260(68): 15200-15203, 1985; Ishiko et al . , 

Phytochemistry 31(5) 11499-1500, 1992) . 

The methylation enzymes are involved in regulating the 
final composition of the tocopherol pool. Data obtained in 
studies of sunflower mutants suggest that the enzymes involved 

15 in methylation have a high degree of influence over relative 

tocopherol amounts but do not affect the overall regulation of 
total tocopherol content (Demurin, Helia 16:59-62, 1993). 
Normally, seed tocopherol composition in cultivated sunflower 
(Helianthus annuus L.) is primarily a-tocopherol (i.e., 95-100% 

20 of the total tocopherol pool) (Skoric et al . , Proceedings of 

the 14th International Sunflower Conference. 1996. 
Beijing/Shenyang, China) . However, two mutant sunflower lines 
were identified with tocopherol compositions of 95% y- 
tocopherol/5% a-tocopherol and 50% (3- tocopherol/50% oc- 

25 tocopherol. Although these presumed tocopherol methylation 

mutants were found to have dramatically different tocopherol 
profiles in seed, total tocopherol levels were not 
significantly different than those of wild type sunflower 
(Demurin, supra 1993) . Based on these results, we hypothesized 

30 that it should be possible to alter the tocopherol profile of 

many plant species by manipulating y-tocopherol 
methyltransf erase expression without affecting the total 
tocopherol pool size. 

The enzyme y-tocopherol methyltransf erase catalyzes the 

35 methylation of y- tocopherol to form a-tocopherol, the final 

step in a-tocopherol biosynthesis. Overexpression of a y 
tocopherol methyltransf erase gene in a plant enhanced the 
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conversion of y-tocppherol to a-tocopherol in any tissue 
containing y-tocopherol / thereby increasing the a- tocopherol : y- 
tocopherol ratio. In fact, seed and oil in which little or no 
a-tocopherol is found can be altered to contain predominantly 
5 a-tocopherol. Conversely, expression of the antisense RNA 

would be expected to reduce expression of the y- tocopherol 
methyltransf erase, causing a decrease in the a- tocopherol :y- 
tocopherol ratio. Plants having increased y- tocopherol may be 
useful for certain industries. 

10 We have discovered that y-tocopherol methyltransf erase 

also catalyzes the conversion of 5-tocopherol to (5- tocopherol . 
Over express ion of y-tocopherol methyltransf erase in plant 
tissue results in increased conversion of 5-tocopherol to £- 
tocopherol. It is expected that expression of y-tocopherol 

15 methyltransf erase antisense RNA would result in reduced 

conversion of 5-tocopherol to (5-tocopherol. 

As demonstrated in the examples below, the seed of 
Arabidopsis plants transformed with a genetic construct 
comprising an Arabidopsis y-tocopherol methyltransf erase gene 

20 under the control of either the seed specific promoter or the 
constitutive cauliflower mosaic virus 35S promoter exhibit a 
dramatic increase in the ratio of a- tocopherol : y-tocopherol . 
No a-tocopherol is detected in the seed of untransf ormed 
Arabidopsis , whereas seed from Arabidopsis transformed with the 

25 y-tocopherol methyltransf erase gene under the control of the 

seed-specific promoter contained about 90% a-tocopherol. Seed 
from Arabidopsis transformed with the y-tocopherol 
methyltransf erase gene under the control of a constitutive 
promoter contained slightly less a-tocopherol (84%) . This 

30 observation demonstrates that for plants natively having a 
tocopherol profile in which a-tocopherol is not predominant 
(i.e. is less than 50% of total tocopherol), that a-tocopherol 
can be made to be the predominant tocopherol form in seed or 
seed oil from a transgenic plant . 

35 Methylation of y-tocopherol to form a-tocopherol is the 

means by which the ratio of the di -methylated tocopherols (y- 
tocopherol) and tri -methylated tocopherol (a-tocopherol) is 
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regulated. By up regulating y-tocopherol methyltransf erase 
expression in tissues in which it is not normally expressed in . 
a plant, it is now possible to increase a- tocopherol levels in 
tissues of many agricultural crops in which y- tocopherol is a 
5 major tocopherol (e.g., maize, soybean, rapeseed, cotton, 

peanut, saf flower, castor bean, rice) . Many common edible seed 
oils have large amounts of y- tocopherol . Increasing the level 
of expression of y-tocopherol methyltransf erase in seed oil 
plants should increase the ratio of a- tocopherol :y- tocopherol . 

10 Isolation and functional analysis of the Y- toc °pherol 

methyltransf erase genes from Synechocystis PCC6803 and 
Arabidopsis thaliana was accomplished by concurrently pursuing 
the complementary molecular genetic approaches described in 
detail in the examples. These two model organisms were 

15 selected because both synthesize tocopherols by similar or 

identical pathways and both are highly tractable genetic, 
molecular, and biochemical systems. 

The DNA sequences of the ytocopherol methyltransf erase 
genes from Synechocystis PCC6803 and Arabidopsis thaliana are 

20 shown in SEQ ID NO:l and SEQ ID NO : 3 , respectively. The 

corresponding deduced amino acid sequences of the proteins are 
shown in SEQ ID NO: 2 and SEQ ID NO:4. 

It is expected that the present invention may be practiced 
using a y- tocopherol methyltransf erase gene from any 

25 photosynthetic organism. It is well within the ability of one 

of skill in the art to isolate a plant y- tocopherol 
methyltransf erase gene using the sequences disclosed herein. 
The usefulness of these sequences to identify other y- 
tocopherol methyltransf erase coding sequences is demonstrated 

3 0 by the fact that it was the Synechocystis sequence that was 

used to identify the Arabidopsis sequence. The two sequences 
can be used to screen public computer databases of plant cDNAs 
(dbest databases) and genomic sequences. Alternatively, the 
sequences could be used to design probes for use in identifying 

35 genomic or cDNA clones containing a y- tocopherol 

methyltransf erase sequence. Another approach would be to use 
the sequences to design oligonucleotide primers for use in PCR 
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amplification of y-tocopherol methyltransf erase genes from 
plant DNA. 

To determine whether one has identified a y- tocopherol 
methyltransf erase sequence, one could perform a gene 
5 replacement study using wild type Synechocystis , a 

complementation study using a Synechocystis y-TMT knockout 
mutant, or an in vitro enzyme assay using a suitable substrate 
and y-tocopherol methyltransf erase protein expressed in E. coli 
or another suitable expression system. A genetic construct 

10 comprising the y-tocopherol methyltransf erase coding sequence 

operably connected to a plant promoter can be constructed and 
used to transform Arabidopsis or a plant or crop plant of 
interest. A transgenic plant comprising the construct in its 
genome would be expected to have altered expression of y- 

15 tocopherol methyltransf erase and an altered tocopherol profile 

relative to an untransf ormed, wild- type plant. 

It is expected that polyploid plants having more than one 
copy of the y- tocopherol methyltransf erase gene may have 
allelic variations among y- tocopherol methyltransf erase gene 

20 sequences. It is anticipated that putative y-tocopherol 

methyltransf erase gene sequences having less than 100% homology 
to SEQ ID N0:1 or SEQ ID NO : 3 encode proteins having y- 
tocopherol methyltransf erase activity. 

It is envisioned that minor sequence variations from SEQ 

25 NO:l or SEQ ID NO : 3 associated with nucleotide additions, 

deletions, and mutations, whether naturally occurring or 
introduced in vitro, will not affect y- tocopherol 

methyltransf erase activity. The scope of the present invention 
is intended to encompass minor variations in y-tocopherol 

30 methyltransf erase sequences. Also, it is now well within the 

level of ordinary skill in the art of plant genetic engineering 
to alter the coding sequence for a gene by changing codons 
specifying for common amino acids or by making conservative 
amino acid substitutions at DNA sequences encoding non-critical 

35 portions of enzymes. 

Construction of an expression vector comprising a y- 
tocopherol methyltransf erase coding sequence operably connected 
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to a plant promoter not natively associated with the coding 
sequence will be achieved using standard molecular biology 
techniques known to the art . The plant promoter may be a 
tissue-specific promoter such as a seed-specific promoter 
5 (e.g., napin or DC3 ) , a constitutive promoter such as CaMV 35S, 

a developmental stage -specific promoter, or an inducible 
promoter. Promoters may also contain certain enhancer sequence 
elements that improve efficiency of transcription. Optionally, 
the construct may contain a termination signal, such as the 

10 nopaline synthase terminator (NOS) . Preferably, the constructs 

will include a selectable or screenable marker to facilitate 
identification of transf ormants . The constructs may have the 
coding region in the sense or antisense orientation. 

Once a genetic construct comprising a y-tocopherol 

15 methyltransf erase gene has been obtained, it can readily be 

introduced into a plant or plant tissue using standard methods 
known to the art. For example, the Agrobacterlum 
transformation system is known to work well with all dicot 
plants and some monocots . Other methods of transformation 

20 equally useful in dicots and monocots may also be used. 

Transgenic plants may be obtained by particle bombardment, 
electroporation, or by any other method of transformation known 
to one skilled in the art of plant molecular biology. The 
experience to date in the technology of plant genetic 

25 engineering has taught that the method of gene introduction 

does not affect the phenotype achieved in the transgenic 
plants . 

A transgenic plant may be obtained directly by 
transformation of a plant cell in culture, followed by 

30 regeneration of a plant. More practically, transgenic plants 

may be obtained from transgenic seeds set by parental 
transgenic plants. Transgenic plants pass on inserted genes, 
sometimes referred to as transgenes, to their progeny by normal 
Mendelian inheritance just as they do their native genes. 

35 Methods for breeding and regenerating plants of agronomic 

interest are known to the art . Experience with transgenic 
plants has also demonstrated that the inserted gene, or 
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transgene, can be readily transferred by conventional plant 
breeding techniques into any desired genetic background. 

It is reasonable to expect that the expression of 
heterologous y-tocopherol methyltransf erase in a transgenic 
5 plant will result in alterations in the tocopherol profile in 

that plant. In addition to the inherent advantage of 
increasing the a- tocopherol :y- tocopherol ratio, changes in the 
tocopherol profile may result in unique, advantageous 
phenotypes. This invention is intended to encompass other 
10 advantageous phenotypes that may result from alterations in 

tocopherol biosynthesis in plants obtained by the practice of 
this invention. 

Using the information disclosed in this application and 
standard methods known to the art, one of skill in the art 
15 could practice this invention using any crop plant or forage 

plant of interest. 

The following nonlimiting examples are intended to be 
purely illustrative. 

EXAMPLES 

20 Example 1. Identification and Characte rization of a 

Putative y-TMT Grfir in Synechocystis PCC68 03 
We recently cloned and characterized the y-tocopherol 
methyltransf erase gene from Synechocystis as follows. An 
Arabidopsis p-hydroxyphenyl- pyruvic acid dehydrogenase 

25 (HPPDase) cDNA sequence (Norris and Delia Penna, submitted, 

Genbank Accession # AF000228, Plant Physiol . , in press) was 
used to search a database containing the DNA sequence of the 
Synechocystis PCC6803 genome (Kaneko et al . , DNA Reg . 3:109* 
136, 1996) . We identified an open reading (designated SLR0090) 

30 that shares a high degree of amino acid sequence similarity 

(i.e. 35% identity and 61% similarity) with the Arabidopsis 
HPPDase enzyme. The putative Synechocystis HPPDase gene is 
located within an operon in the Synechocystis genome comprised 
of 10 open reading frames (ORFs) encompassing bases 2,893,184 

35 to 2,905,235 of the published Synechocystis PCC6803 genome 

(Kaneko et al . , supra 1996). We hypothesized that this operon 
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might also contain .additional genes that encode other enzymes 
involved in tocopherol synthesis. 

Two ORFs (SLR0089 and SLR0095) were identified as possible 
candidates for Synechocystis tocopherol methyltransf erase 
genes. BLAST searches with ORFs SLR0089 and SLR0095 showed 
that these proteins share a high degree of similarity to the 
known protein sequences of A- (24 ) -sterol-C-methyltransf erases 
and various plant caffeol CoA-O-methyltransf erases , 
respectively. Both SLR0089 and SLR0095 proteins contain 
consensus sequences corresponding to conserved S-adenosyl- 
methionine (SAM) binding domains (Kagan and Clarke, Archives of 
Biochem. and Biophy. 3 10 (2 ): 4 17 -427 , 1996). The SLR0089 
protein contains other structural features that are consistent 
with features found in a tocopherol methyltransf erase . These 
features were not found in SLR0095. First, PSORT (Prediction 
of Protein Localization Sites) computer analysis of the two 
protein sequences predict that SLR0089 is localized to the 
plasma membrane, whereas and SLR0 0 95 is localized to the 
cytosol . Tocopherol biosynthesis in cyanobacteria is believed 
to occur in the plasma membrane; therefore, localization of 
SLR0 08 9 protein to the plasma membrane suggests that it may be 
a tocopherol methyltransf erase . Additionally, PSORT analysis 
identified the presence of a putative bacterial signal sequence 
in the first 25 amino acids of the SLR008 9 protein. The 
predicted molecular weight of the mature SLR0089 protein (after 
truncation of the signal sequence) is 32,766 daltons, which is 
very close to the reported molecular weight (33,000 daltons) of 
purified from pepper fruits (d'Harlingue and Camara, supra 
1985). The predicted molecular weight of SLR0095 is 24,322 
daltons. Therefore, we concluded that of the two identified 
ORFs, the SLR0089 gene was more likely to be a tocopherol 
methyltransf erase . 

Example 2. Amplif icatipft cu>d dpi^ng of the gynechogys ti? 

Y-TMT gene 

Synechocystis genomic DNA was isolated by the method of 
Williams ( Methods Enzymol . 167 : 776-778 , 1987). The SLR0089 



-12- 



WO 99/04622 ' PCT/US98/15137 

gene was amplified from Synechocystis genomic DNA by polymerase 
chain reaction (PCR) using a sense strand specific 
primer (SLR0089F, SEQ ID NO: 5) and a non-sense strand specific 
primer SLR0089R (SEQ ID NO: 6) under the following conditions: 
5 The amplification of the SLR0089 open reading frame was 

conducted in a 50/^1 reaction volume containing 0.4 mM dATP, 0.4 
mM dGTP, 0.4 mM dCTP, 0 . 4 mM dCTP, 0 . 4 mM dTTP, 0.2 /iM SLR0089F 
primer, 0 . 2 fM SLR0089R primer, 10 ng Synechocystis PCC6803 
genomic DNA, 20 mM Tris-HCl (pH 8.4), 50 mM KC1, 2 mM MgCl 2 , 
10 and 2.5 units Tag polymerase (Gibco-BRL) . PCR thermocycle 

conditions were performed as follows: 

5 minutes 95°C (1 cycle) 

1 minute 95°C -> 1 minute 55°C -> 1.5 minutes 72°C (35 cycles) 
7 minutes 72°C (1 cycle) 

15 The PCR product comprising the SLR0089 ORF was cloned 

using standard molecular biological techniques known to one of 
skill in the art. Briefly, the amplified SLR0089 ORF was 
purified and made blunt ended by treatment with the Klenow 
fragment. The SLR0089 gene was ligated to EcoRV- linearized 

20 pBluescript KS II (Stratagene, Inc., LaJolla, CA) . The 

ligation mixture was used to transform competent E. coli DH5a 
cells, and putative transf ormants were selected on the basis of 
ampicillin resistance. A plasmid designated pH-1 that was 
isolated from a transformant was found to contain the SLR0089 

25 insert. The identity of the SLR0089 gene (SEQ ID NO:l) was 

confirmed by sequencing using T7 and T3 sequencing primers . 

Example 3. Development of a SLR0089 knockout mutant 

A gene replacement vector was constructed using standard 
molecular biology techniques. The plasmid pHl, which contains 
3 0 a unique Ncol site in the SLR008 9 ORF, was digested with Ncol 

restriction endonuclease . The aminoglycoside 3 1 -phospho- 
transferase gene from Tn903 was ligated to the Ncol site of pHl 
and the ligation mixture was used to transform E * coli DH5a 
cells. Transf ormants were selected using kanamycin and 
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ampicillin. A recombinant plasmid (pQ-l) containing the 
disrupted SLR0089 ORF was isolated and used to transform 
Synechocystis PCC6803 according to the method of Williams 
( Methods Enzvmol . 167 : 776-77A . 1987) . 

Synechocystis transf ormants were selected for on BG-11 
medium (Castenholz, Methods in Enzymology p 68-93, 1988) 
containing 15 mM glucose and 15 /xg/ml kanamycin. All cultures 
were grown under continuous light at 26°C. Four independent 
transf ormants were carried through five subculturings of single 
colonies to fresh medium. PCR and genomic analysis were used 
to confirm that the gene replacement was successful and 
complete . 

Example 4. Tocopherol, profiles of wild type and mutant 

SynechQcygtjg 

Approximately 200 mg of cells were scraped from 2 week old 
Synechocystis cultures grown on BG-11 agar medium. The cells 
were homogenized in 6 ml of 2 : 1 (volume : volume) methanol : CHC1 3 
containing 1 mg/ml butylated hydroxytolulene (BHT) using a 
polytron homogenizer. Following homogenization, 2 ml of CHC1 3 
and 3.4 ml of double-distilled water was added to the 
homogenate . The lower lipid phase was removed and dried under 
nitrogen gas. The dried lipids were resuspended in 200/xl of 
HPLC grade ethyl acetate containing 1 mg/ml BHT. 

Tocopherols were analyzed by reverse phase HPLC using a 
Hewlett-Packard Series 1100 HPLC system with a fluorescence 
detector. Crude lipid extracts were fractionated on a Water 
Spherisorb S5 ODS2 4.6 X 250 mm column in a mobile phase 
consisting of 75% methanol and 25% isopropanol and a flow rate 
of 1 ml/min. The fluorescence was measured at 330 nm after 
excitation at a wavelength of 290 nm. 

Wild-type Synechocystis produces a-tocopherol as its most 
abundant tocopherol (>95%of total tocopherols) . The SLR0 08 9 
disrupted mutant of Synechocystis is no longer able to 
synthesize a-tocopherol and instead accumulates ytocopherol as 
its sole tocopherol. The elimination of a-tocopherol 
production and concomitant accumulation of y- tocopherol 
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conclusively demonstrates that SLR0089 encodes y-tocopherol 
methyltransf erase , the final step in a- tocopherol biosynthesis. 

Example 5. Identification of a Putative Arabidopsis y-TMT 

cDNA from the E ST Database 
The Arabidopsis EST database (Ausbel et al . , Current 
Protocols in Molecular Biology , Greene Publishing and Wiley- 
Interscience, N.Y. , 1987) was searched using the Synechocystis 
y-TMT DNA and protein sequences as queries. Two cDNA clones 
that share significant homology with the Synechocystis sequence 
were identified: the Arabidopsis A- (24) -sterol-C- 
methyltransf erase and the Arabidopsis expressed sequence tag 
(EST) clone 165H5T7. Because the A- (24 ) - sterol-C- 
methyltransf erase was functionally identified by its ability to 
complement a yeast A- (24 ) -sterol-C-methyltransf erase mutant 
(ergr6) , we are confident that the clone does not encode a y-TMT 
(Husselstein et al . , FEBS Letters 381:87-92, 1996). Therefore, 
we decided to focus our efforts on the Arabidopsis 165H5T7 EST 
clone (Genbank Accession #R30539) . The DNA sequence of the 
165H5T7 EST clone was determined (SEQ ID NO: 3) and the amino 
acid sequence of the putative protein was deduced. The 
sequence was aligned with that of the Synechocystis y-TMT (Fig 
1) . The full-length 165HT7 clone encodes a protein that is 35% 
identical and 66% similar to the Synechocystis y-™ T anc * 
exhibits large blocks of identity. When 165H5T7 was used as 
query against the non- repetitive protein database, it was found 
to have the highest homology to SLR0 08 9 (P<lCr 54 ) and only 
moderate homology to the four known plant A- (24) -sterol-C- 
methyltransf erases (P>.10" s ) . 165H5T7 also contains conserved 
SAM binding motifs common to a large number of 

methyltransf erases (Fig. 1) but lacks proposed sterol binding 
domains common in the four plant A- (24) -sterol-C- 
methyltransf erases identified to date (Husselstein et al . , 
supra 1990) . These data suggest that clone 165H5T7 encodes an 
Arabidopsis y-TMT homologue, which we have designated A.t.y- 
TMT. 
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Example 6. Characterization of the putfltiv^ Axr&bi&op&it? y- 

tmt hQmQiogy^ nsing the gene replacement in 

Plant cDNAs encoding putative y-TMT homologues may be 
5 functionally identified using one of two gene replacement 

approaches in Synechocystis . One approach that may be employed 
is to replace the endogenous Synechocystis y-TMT gene in wild 
type Synechocystis with the putative Arabidopsis y-TMT cDNA 
165H7T7. A Synechocystis y-TMT (coding sequence # SLR0089) gene 

10 replacement vector will be constructed to include the following 

features, in 5 1 to 3 1 order: 1) at least 300 base pairs of DNA 
sequence corresponding to the Synechocystis genomic sequence 
found immediately upstream (5 1 ) of the native SLR0089 gene; 2) 
the first 77 base pairs of the SLR0089 ORF corresponding to the 

15 identified bacterial signal sequence that ends with a unique, 

in- frame Ncol site; 3) a polylinker or multiple cloning site; 
4) an antibiotic resistance marker (e.g., a kanamycin 
resistance gene cassette) ; and 5) at least 300 base pairs of 
DNA sequence corresponding to the Synechocystis genomic 

20 sequence found immediately downstream (3') of the native 

SLR0089 gene. The putative plant y-TMT cDNA to be tested for 
complementation will be inserted into the Ncol site or into the 
multiple cloning site. 

The 165H5T7 cDNA may be engineered to contain an Ncol site 

25 at the transit peptide cleavage site predicted by PSORT using 

PCR mutagenesis, which would change the amino acid Val-48 to 
Met. The cDNA owill be ligated to the unique Ncol site in the 
SLR0089 gene replacement plasmid to create an in-frame, amino- 
terminal fusion between the Synechocystis y-TMT signal peptide 

30 and the plant protein sequence. The construct will be used to 

transform wild type Synechocystis ; transf ormants will be 
identified by kanamycin selection. After several single colony 
passages under selection, gene replacement will be confirmed by 
PCR. The tocopherol profile of transf ormants will be 

35 determined by HPLC. Synechocystis transf ormants functionally 

expressing Arabidopsis y-TMT genes will be identified by their 
ability to synthesize a-tocopherol in the absence of a 
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functional Synechocystis y-TMT gene. 

In an alternative approach, the putative y~ t MT gene may be 
characterized according to its ability to complement the 
Synechocystis y-TMT knockout mutant. The replacement vector 
5 could be constructed to include the intact putative y-TMT gene 

and an antibiotic resistance marker other than kanamycin. 
Following transformation and selection, gene replacement can be 
confirmed by PCR and the transf ormants may be further 
characterized by tocopherol analysis. 

10 Example 7. Function al characterization of Arabidoosis and 

Synechocystis v-TMT gene s by expression in E. 
coli 

The proteins encoded by the Synechocystis SLR0089 gene and 
the Arabidopsis 165h5T7 cDNA clone were identified as y-TMTs 

15 through functional expression in E. coli. 

The SLR0089 gene was amplified from the Synechocystis 
PCC6803 genome using polymerase chain reaction (PCR) The 
forward primer (SLR0089coliF, SEQ ID NO:7), was designed to add 
a BspHI site to the 5 1 end of the primer. The reverse (3') PCR 

20 primer (SLR0 08 9coliR, SEQ ID NO: 8) was designed with a Bg-211 

site engineered at the 5 1 end of the primer. 

The PCR reaction was conducted in two 100 -fil reaction 
mixtures, each of which contained dNTPs (0.4 mM each), 2 fxM 
SLR0089coliF, 2 fiM SLR0089coliR, 10 ng Synechocystis PCC6803 

25 genomic DNA, 10 mM KC1, 6 . 0 mM (NH 4 ) 2 S0 4/ 20 mM Tris-HCl (pH 

8.2) , 2 mM MgCl 2 , 0.1% Triton X-100, 10 fig /ml BSA, 2.5 units 
Pfu polymerase (Stratagene, LaJolla, CA) . The following 
thermocycle conditions were used: 

5 minutes 95°C (1 cycle) 
30 0.75 minutes 94°C -> 0.75 minutes 55°C -> 2 minutes 72°C (30 

cycles) 

10 minutes 72°C (1 cycle) 
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The PCR fragment was gel -purified and ligated to EcoKV- 
linearized pBluescript KS II (Stratagene, LaJolla, CA) . The 
ligation product was used to transform E. coli strain DH5a, and 
putative transf ormants were selected on the basis of ampicillin 
5 resistance. A recombinant plasmid containing the insert 

(designated p082297) was sequenced to confirm the correct 
amplification and subcloning of the SLR0089 sequence. 

The deduced amino acid sequence of SLR0089 contains a 
putative amino-terminal bacterial signal sequence comprising 

10 the first 24 amino acids of the deduced amino acid sequence. 

Because this amino-terminal signal sequence could effect the 
conformation of the SLR0089 protein when expressed in E. coli 
and render the protein inactive, we modified the SLR0089 DNA 
sequence such that it encodes a truncated protein devoid of the 

15 putative amino-terminal bacterial signal sequence. The SLR0089 

gene contains a Ncol recognition sequence at the predicted 
cleavage site for the putative bacterial signal sequence. A 
Ncol-Bglll fragment containing a truncated SLR0089 DNA sequence 
from p0822 97-coli was subcloned in the correct reading frame 

20 into the Ncol and BamHI sites of the T7 E. coli pET3D 

expression vector (Novagen, Madison, WI) . The ligation mixture 
was used to transform E. coli BL21 (DE3) and transf ormants were 
selected for on the basis of ampicillin resistance. A plasmid 
(designated p011698-l) containing the insert was identified by 

25 restriction digest analysis with the enzyme Hindlll. 

The 165H5T7 cDNA clone was also subcloned into the pET3D 
expression vector. The first 50 N- terminal amino acids of the 
deduced amino acid sequence of 165H5T7 contains a putative 
amino-terminal chloroplast targeting sequence that could effect 

30 the conformation of the 165H5T7 protein when expressed in E. 

coli and render the protein inactive. Therefore, we modified 
the 165H5T7 DNA sequence to encode a truncated protein devoid 
of the putative amino-terminal chloroplast targeting sequence. 
The truncated 16 5H5T7 DNA sequence was obtained by PCR 

35 amplification of 165H5T7 cDNA using primers designed to amplify 

the sequence corresponding to the region between nucleotide 3 53 
and nucleotide 1790 of the original 165H5T7 sequence. The 
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forward PCR primer .( 165matF, SEQ ID NO:9) adds a Ncol site to 
the 5» end of the truncated 165H5T7 sequence to facilitate 
cloning into the pET3D vector. The reverse (3 1 ) PCR primer 
(165matR, SEQ ID NO:10) was designed from the polylinker region 
5 of the pSPORTl vector with a AccI site engineered at the 5 1 end 

of the primer. The PCR reaction was conducted with the 165matF 
and 165matR primers (2/iM each) using the same PCR conditions 
described for the amplification of the truncated Synechocystis 
gene , above . 

10 Following gel purification, the PCR fragment was ligated 

to EcoRV- linearized pBluescript KS II, the ligation product was 
used to transform E. coli strain DH5a, and ampicillin-resistant 
putative transf ormants were selected. A recombinant plasmid 
(designated p010498-2) containing the insert was identified. 

15 The DNA sequence of p0104 98-2 was determined to confirm the 

correct amplification and subcloning of the truncated 165H5T7 
sequence. The truncated 165H5T7 DNA sequence was subcloned as 
a NcoI-BairiKI fragment pET3D vector digested with Ncol and 
BarriHI . The ligation product was used to transformed E. coli 

20 DH5a and transf ormants were selected for on the basis of 

ampicillin resistance. A plasmid (designated p011898-l) 
containing the insert was identified by restriction digest 
analysis with the enzyme Hindlll. 

The p011698-l and p011898-l constructs were used to 

25 transform the E. coli T7 expression host BL21 (DE3 ) . To 

generate protein for y-TMT assays, one liter cultures of 
transformed host cells containing one of the constructs were 
grown in Luria broth containing 100 mg/liter ampicillin. Each 
culture was started at an optical density at 600 nm (ODeoo) of 

30 0.1 and incubated in a shaking incubator at 28°C until the 

culture reached an ODeoo of 0.6, at which time isopropyl- (J-D- 
thiogalactopyranoside (IPTG) was added to each culture to 
obtain a final concentration of 0.4 mM IPTG. Each culture was 
incubated for an additional 3 hours at 28°C and the cells were 

35 harvested by centrif ugation at 8,000 g. The cell pellets were 

then resupended in 10 ml of 10 mM HEPES (pH 7.8), 5 mM DTT, 
0.24 M sorbitol, 1 mM PMSF. The cells were lysed by sonication 
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with a micro-tip sonicator using four 10-second pulses. Triton 
X 100 was added to each homogenate to a final concentration of . 
1%. The homogenates were incubated on ice for 30 minutes, and 
subjected to centrif ugation at 30,000 g for 30 minutes at 4°C. 
5 The supernatants of these extracts were assayed for y- 

tocopherol methyltransf erase activity as follows. 

The y-TMT assays were performed in 2 50 fil volumes 
containing 50 mM Tris (pH 7.5 for the Synechocystis and pH 8.5 
for the Arabidopsis enzyme) , 5 mM DTT, 5 mM y- or 5- tocopherol , 

10 and 0.025 /xCi (55/xCi/mmole) ( 14 C-methyl) -S-adenosylmethionine . 

Reaction mixtures were incubated at room temperature for 3 0 
minutes. The reactions were stopped by adding of 1 ml of 2 : 1 
(v:v) CHC1 3 : methanol containing 1 mg/ml butylated 
hydroxytolulene (BHT) and 250 fil of 0.9% NaCl in water, and 

15 vortexing. The samples were centrif uged to separate the 

phases. The CHC1 3 (lower) phase was transferred to a fresh 
tube containing 100 mg of a-tocopherol and the CHC1 3 was then 
removed under vacuum in a speed-vac . The dried lipid fraction 
was resuspended in 50 /xl ethyl acetate containing 1 mg/ml BHT. 

20 The lipid extracts were fractionated on silica 60 TLC plates 

in dichloromethane . Tocopherols were then identified by co- 
migration with authentic tocopherol standards after staining 
the plate with Emmerie-Engels sdlution (0.1% FeCl3, 0.25% 2,2'- 
dipyridyl in ethanol) . The band corresponding to a- tocopherol 

25 was scraped from the TLC plate and the amount of radioactive 

material present was determined by scintillation counting. 
These experiments showed that the proteins encoded by the 
Synechocystis SLR00 8 9 and Arabidopsis 16 5H5T7 DNA sequences 
were able to convert y-tocopherol to a-tocopherol. 

3 0 The Synechocystis and Arabidopsis y- tocopherol 

methyltransf erases were tested for activity using several 
different methyl -substituted tocopherol substrates. Both 
enzymes were able to specifically convert 5-tocopherol to [J- 
tocopherol . The two enzymes were unable to use tocol, 5,7- 

35 diemethyltocol, (3- tocopherol , and y-tocotrienol as substrates. 

These results indicate that both the Synechocystis and 
Arabidopsis y-tocopherol methyltransf erases catalyze the 

-20- 



BNSDOCID: <WO 9904622A1_I_> 



WO 99/04622 PCT/US98/15137 

methylation of carbon 5 of the tocopherol chromanol ring. The 
Synechocystis and Arabidopsis y-TMTs appear to require 
substrates with a methyl-group present on the 8 position of the 
chromanol ring and a fully saturated prenyl- tail for activity. 
5 Our results indicate that Arabidopsis y-TMT exhibits greater 

activity with y- tocopherol as the substrate than with the 5- 
tocopherol substrate, whereas the Synechocystis y-TMT appears 
to be equally active toward y-tocopherol and 5- tocopherol . 

Example 8. Qualitative man ipulation of tocopherols in 

10 Arabidopsis and o ther plants bv over expressing 

the Arabidopsis v-tocopherol methvltransf erase . 

The results from HPLC analysis of lipid extracts made from 
Arabidopsis leaves and seeds indicate that these tissues have 
relatively simple tocopherol profiles. In Arabidopsis leaves, 

15 a-tocopherol is present at -90% of the total tocopherol 

content, with y-tocopherol comprising the remainder of the 
tocopherol content. In Arabidopsis seeds, y-tocopherol is 
present at -95% of the total tocopherol content in Arabidopsis 
seeds with the remaining 5% being composed of 6- tocopherol . 

2 0 These simple tocopherol profiles make Arabidopsis seed and leaf 

tissue ideal targets for evaluating the functional consequences 
of altering the expression of a y-tocopherol methyltransf erase 
gene in plants. 

We hypothesized that increasing the expression of a y- 

25 tocopherol methyltransf erase gene in Arabidopsis would increase 

a-tocopherol levels as a proportion of the total tocopherols. 
To test this hypothesis, the full-length Arabidopsis y- 
tocopherol methyltransf erase cDNA clone 165H5T7 was over- 
expressed under the control of the strong constitutive 

30 cauliflower mosaic virus 35S transcript (CaMV 35S) promoter and 

the embryo- specif ic carrot DC3 promoter (Seffens WS et al . , 
Dev , Genet . 11: 65-76,1990) in transgenic Arabidopsis . 

The seed- specif ic plant gene expression plasmid was 
constructed from a derivative of the Agrobacterium plant 

35 transformation vector, pBIB-Hyg (Becker, D. Nucleic Acids Res. 
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18:203, 1990). The carrot embryo DC3 promoter was isolated 
from the plasmici pBS-DC3 5 1 PH after digestion with Hindlll and 
BamHI . The DC3 Hindlll and EamHI promoter fragment was then 
treated with DNA polymerase to fill in the 5 1 over-hanging 
5 ends. The pBIB-Hyg plasmid was digested with Hindlll and then 

treated with DNA polymerase to fill-in the 5 1 over-hanging 
ends. The DC3 promoter fragment was ligated to pBIB-Hyg to 
create a plasmid designated plll397. The Arabidopsis y- 
tocopherol methyltransf erase cDNA 165H5T7 was subcloned in the 

10 sense orientation as a Sall-Xbal fragment into the Sail and 

Xbal sites of p!11397 to obtain pl22997. The pl22997 plasmid 
has the following features: 1) plant hygromycin selectable 
marker; 2) AgroJbacterium T-DNA left and right border sequences; 
3) the Arabidopsis 165H5T7 y- tocopherol methyltransf erase cDNA 

15 cloned between the carrot seed specific DC3 promoter and the 

nopoline synthase 3 1 transcriptional termination sequences; 4) 
the RK2 broad host bacterial plasmid origin of replication; and 
5) bacterial kanamycin resistance selectable marker. 
The constitutive Arabidopsis y- tocopherol 

20 methyltransf erase gene expression plasmid was derived from 

pSN506 CaMV 35S binary plant expression vector, a pART27 
derivative in which the p-hydroxyphenol pyruvic acid 
dioxygenase (HPPDase) cDNA is under the control of the CaMV 3 5S 
promoter. (Norris and Delia Penna, in press) . The CaMV 3 5S/y- 

25 tocopherol methyltransf erase construct was made by replacing 

the HPPDase cDNA with the full length 165H5T7 cDNA sequence. 
The HPPDase cDNA fragment was removed from pSN506 by digesting 
the plasmid with Xbal and Xhol . The 5 1 DNA over-hanging ends 
of the pSN506 Xbal -Xhol vector fragment were filled in using 

30 the Klenow fragment of the E. coli DNA polymerase. The 

linearized vector was ligated to a blunt -ended Xbal -Sail 
fragment from 165H5T7 encoding the full length y-tocopherol 
methyltransf erase . A recombinant plasmid containing the insert 
was obtained and designated p010398. The plasmid p010398 

35 contains the following characteristics: 1) plant kanamycin 

selectable marker; 2) agrobacterium T-DNA left and right border 
sequences; 3) the Arabidopsis 165H5T7 y-tocopherol 
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methyltransf erase cDNA cloned between the CaMV 3 5S promoter and 
the nopoline synthase 3 1 transcriptional termination sequences;. 
4) the RK2 broad host bacterial plasmid origin of replication; 
and 5) bacterial kanamycin resistance selectable marker. 
5 The constitutive and seed specific y-tocopherol 

methyltransf erase plant gene expression constructs (pl22997 and 
p0103 98) and the appropriate empty vector control vectors 
(pART2 7 and plll3 97) were used to transform Agrobacterium 
tumefaci ens strain C58 GV3101. Wild type Arabidopsis (ecotype 

10 Columbia) plants were transformed with these Agrobacterium 

strains using the vacuum infiltration method (Bechtold N, Ellis 
J, Pelletier G, in planta Agrobacterium mediated gene transfer 
by infiltration of adult Arabidopsis thaliana plants. CR Acad 
Sci Paris, 1993. 1144(2): 204-212). Seeds from the primary 

15 transf ormants were selected for resistance to the appropriate 

antibiotic on medium containing MS salts, 1% sucrose, 0.7% 
agar, and suitable levels of the antibiotic. Antibiotic 
resistant seedlings (representing the Tl generation) were 
transferred to soil and grown to maturity. Leaf and seed 

20 material from these Tl generation plants were analyzed by HPLC. 

Example 9. Characterization of Transgenic Pl^ntp . 

A. Analysis of transgenic Arabidopsis Tocopherol Profiles 

Known weights of approximately 5 mg of plant material 
(i.e. seed or leaf) and 100 ng of tocol (for use as an internal 

25 standard) were homogenized in 300 /xl of 2:1 (V/V) methanol: 

CHC1 3 containing 1 mg/ml butylated hydroxytolulene (BHT) . One 
hundred /xl of CHC1 3 and 180 /xl of 0.9% (w/v) NaCl in water were 
added to the homogenate and the mixture was briefly vortexed. 
The mixture was then centrifuged and the lower (CHC1 3 ) fraction 

30 was removed and transferred to a fresh tube. The CHC1 3 fraction 
was dried under vacuum and the resulting lipid residue was 
resuspended in 100 /xl of ethyl acetate for analysis by C18 
reverse phase HPLC or in 100 /xl of hexane for analysis by 
normal phase HPLC. 
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Crude lipid extracts were analyzed by normal phase or 
reverse phase HPLC for changes in tocoperhol profiles. 
Individual tocopherol species were quantified by comparing 
their fluorescence signals with standard curves made from known 
5 quantities of authentic tocopherol standards. Reverse phase 

HPLC was done as describe in example 4 . Normal phase HPLC 
analysis was done on a Licosorb Si60A 4.6 X 250 mm HPLC column 
using the following conditions: 
Column temperature: 42°C 

10 mobile phase: solvent A = HPLC grade hexane 

solvent B = diisopropylether 

Gradient : time ^solvent A % solvent B flow rate 

(ml/min) 

0 92% 8% 1 

15 20 82% 18% 1 

25 82% 18% 1 

25 92% 8% 2 

34 92% 8% 2 

Fluorescence Detector Settings: 
2 0 excitation wavelength: 290 nm 

emmission wavelentgh: 325nm 

The concentrations of the various tocopherol species 
obtained by HPLC analysis of Tl seed material from Arabidopsis 
plants transformed with p!22997, p010398, plll398, pART27 are 

25 shown in Table 1. Plants over-expressing the y-tocopherol 

methyltransf erase using either the CaMV 35S or carrot DC3 
promoters are able to convert the majority of the y- tocopherol 
normally present in Arabidopsis seeds to a- tocopherol and also 
are able to convert the majority of the 5 -tocopherol normally 

30 present in Arabidopsis seeds to (J - tocopherol . These results 

show that Y" tocopherol methyltransf erase activity is normally 
limiting in Arabidopsis seeds. 
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B. Analysis of y- tocopherol methyltransf erase activity in 
transgenic Arabidopsis seed 

Seeds from the Tl generation plants transformed with 
pl22997, p010398, plll397, and pART27 were assayed for y- 
5 tocopherol methyltransf erase activity. Protein extracts were 

made by homogenizing approximately 10 mg of seeds in 200 fil of 
50 mM Tris pH 8.5, 5 mM DTT, 1% Triton X 100, 1 mM PMSF. The 
extracts were centrifuged for 5 minutes to remove insoluble 
material. A 25-/xl aliquot of each extract supernatant was 

10 assayed for y-tocopherol methyltransf erase activity as 

described in example 7. No y-tocopherol methyltransf erase 
activity was detected in wild type seeds and empty vector 
controls. Activity in seed-specific lines was approximately 2 
pmol/hr/mg protein, and in 35S constitutive expression lines 

15 activity was 0.5 pmol/hr/mg protein. 

Example 11. Qtfoer Transgenic Plant?. 

Based on this data demonstrating that a simple insert of a 
a-tocopherol methyl transferase gene into a plant can 

20 dramatically change the relative proportions of tocopherols in 

a plant seed, it becomes possible to reasonably suggest the 
similar results that can be obtained in other plant species. 

It is expected that one may manipulate tocopherol profiles 
in any plant species using the methods disclosed in the 

25 examples. Based on the concentration of the various 

tocopherols in untransf ormed plant tissue, we have predicted 
tocopherol profiles obtainable for a variety of plant tissue 
(Table 2). Note that several common plant oils (e.g. soybean) 
which are predominantly y- tocopherol and contain low levels of 

30 a-tocopherol can be altered to be predominantly a- tocopherol . 

All publications cited in this patent application are 
incorporated by reference herein. 

The present invention is not limited to the exemplified 
embodiment, but is intended to encompass all such modifications 

35 and variations as come within the scope of the following 

claims . 
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Crop Species 
(tissue) 


Tocopherol composition 
of untransf ormed plant 


Expected tocopherol 
composition of transgenic 
plants with y-TMT over- 
expressed 


Soybean 1 (seed/oil) 


70% y- tocopherol 
22% 6- tocopherol 
7% a- tocopherol 
1% 3 - tocopherol 


77% a-tocopherol 
23% 3 -tocopherol 


Oil Palm 1 (seed/oil) 


2 5% a- tocopherol 
30% a-tocotrienol 
4 0% y- tocotrienoi 

5% 5 -tocotrienoi 


2 5% a-tocopherol 
70% a-tocotrienol 
5% 3-tocotrienol 


Peanut 2 (raw nut) 


50% a- tocopherol 
50% y- tocopherol 


100% a-tocopherol 


Peanut 2 (nut oil) 


3 3% a- tocopherol 
66% y- tocopherol 


100% a-tocopherol 


Saf flower 2 (seed oil) 


48% a- tocopherol 
22% y- tocopherol 
3 0% 6- tocopherol 


70% a-tocopherol 
3 0% 3 -tocopherol 


Rapeseed 2 (seed oil) 


25% a- tocopherol 
75% 5 -tocopherol 


100% a-tocopherol 


Cotton Seed 1 (seed oil) 


40% a- tocopherol 
58% y- tocopherol 
2% 5 -tocopherol 


98% a-tocopherol 
2% 3 -tocopherol 


Wheat 2 (whole wheat 
flour) 


20% a -tocopherol 
7% a-tocotrienol 
17% (J - tocopherol 
56% 3-tocotrienol 


20% a-tocopherol 
7% a-tocotrienol 
17% 3 -tocopherol 
56% 3-tocotrienol 


Wheat 1 (germ oil) 


75% a- tocopherol 
25% Y" tocopherol 


100% a-tocopherol 


Corn 1 (oil) 


22% a-tocopherol 
68% y- tocopherol 
3% 3 -tocopherol 
7% 5- tocopherol 


90% a-tocopherol 
10% 3 -tocopherol 


Castor Bean 2 (oil) 


50% y~ tocopherol 
50% 5 -tocopherol 


50% a-tocopherol 
50% 3- tocopherol 


Corn 2 (whole grain) 


11% a-tocopherol 
69% Y" tocopherol 
4% a-tocotrienol 
9% y-tocotrienol 
7% 3-tocotrienol 


80% a-tocopherol 
13% a-tocotrienol 
7% 3-tocotrienol 


Barley 2 (whole graxn) 


14% a-tocopherol 
2% Y* tocopherol 
10% 3-tocopherol 

44% a- tocotrienoi 
7% y-tocotrienol 

23% 3 -tocotrienoi 


lot a-tocopneroi 
10% 3-tocopherol j 
51% a-tocotrienol 


Rice 2 (whole grain) 


50% a-tocopherol 
50% y- tocopherol 


100% a-tocopherol 


Potato 2 (tuber) 


95% a-tocopherol 
5% y- tocopherol 


100% a-tocopherol 
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5 



Sunflower 2 (seeds raw) 


95% a- tocopherol 
y cocopnerox 


100% a- tocopherol 


Sunflower 1 (seed oil) 


96% a- tocopherol 
Y cocopneroi 
2% & -tocopherol 


98% a- tocopherol 
2* p- tocopherol 


Banana 1 ( fruit ) 


J.UUY ot- tocopneroi 


100% a- tocopherol 


Lettuce 1 (leaf) 


53% a- tocopherol 
47% y~ tocopherol 


100% a- tocopherol 




72% ot- tocopherol 
28% y~ tocopherol 


100% a- tocopherol 


Cauliflower 2 


44% a- tocopherol 
66% y~ tocopherol 


100% a- tocopherol 


Cabbage 1 


100% a- tocopherol 


100% a- tocopherol 


Apple 2 


100% a -tocopherol 


100% a- tocopherol 


Pears 2 


93% a- tocopherol 
7% Y" tocopherol 


100% of- tocopherol 


Carrots 2 


94% a- tocopherol 
4% y- tocopherol 
2% 6- tocopherol 


98% a- tocopherol 
2% (J- tocopherol 



McLaughlin, P.J, Weihrauch, J.C. "Vitamin E content of foods", J . Am. Diet 
fcaS^ 75:647-665 (1979). 

2 Bauemfeind, J. "Tocopherols in foods", In Vitamin E: A Comprehensive 
Treatise, L.J Machlin ed., Marcel Dekker, Inc. New York pp 99-168. 
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CLAIMS 

We claim: 

1. An isolated DNA fragment comprising a Y- toco Pherol 
methyltransf erase coding sequence . 

5 2. The DNA fragment of claim 1, wherein the fragment is 

selected from the group consisting of SEQ ID N0:1 and SEQ ID 
NO: 3 . 

3. An isolated DNA fragment comprising Arabidopsis y- 
tocopherol methyltransf erase . 

10 4. An isolated DNA fragment comprising Synechocystis y- 

tocopherol methyltransf erase - 

5 . A genetic construct comprising a y-tocopherol 
methyltransf erase coding sequence operably connected to a plant 
promoter not natively associated with the coding sequence. 

15 6. A genetic construct as claimed in claim 5, wherein the 

Y-tocopherol methyltransf erase coding sequence is selected from 
the group consisting of SEQ ID NO:l and SEQ ID NO: 3. 

7. A transgenic plant comprising in its genome the 
genetic construct of claim 5 . 

20 8. The plant of claim 5, wherein the plant has an altered 

a-tocopherol :y~ tocopherol ratio relative to an untransf ormed 
wild-type plant. 

9 . The seed of the plant of claim 8 . 

10. The plant of claim 5, wherein the plant has an 
25 altered 5- tocopherol : p- tocopherol ratio relative to an 

untransf ormed wild-type plant. 
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11. The seed of the plant of claim 10. 

12. Oil from the seed of claim 11. 

13. A transgenic plant of a species in which natively a- 
5 tocopherol is not the predominant tocopherol in its seeds, the 

transgenic plant altered to produce a- tocopherol as the most 
abundant tocopherol in the seeds of the plant . 

14 . Seeds of the plant of claim 13 . 

15. Oil from the seeds of claim 14. 

10 16. A transgenic plant as claimed in claim 13 wherein the 

transgenic plant carries in its genome a foreign genetic 
construction comprising a y-tocopherol methyltransf erase gene 
selected from the group consisting of SEQ ID N0:1 and SEQ ID 
NO: 3 . 

15 17. A transgenic plant which has an altered profile of 

tocopherols in its seeds or oils compared to non- transgenic 
plants of the same species. 

18. Seed of the plant of claim 17. 

19. Oil from the seeds of claim 18. 

20 20. A transgenic plant seed of a plant species in which 

a- tocopherol is natively not the predominant tocopherol in 
seeds, the transgenic plant seed containing a- tocopherol as the 
most abundant tocopherol present in the transgenic plant seed. 

21. Oil from the seed of claim 20. 
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22. A transgenic plant having an altered relative 
proportion of tocopherols in its tissues as compared to non- 
transgenic plants of the same species, the transgenic plant 
comprising in its genome an inserted Y- toco P hero1 

5 methyltransf erase coding sequence . 

23. The plant of claim 22 wherein the y-tocopherol 
methyltransf erase is in the sense orientation. 

24. The plant of claim 22 wherein the ytocopherol 
methyltransf erase is in its antisense orientation. 

10 25. A method of producing a-tocopherol comprising the 

steps of : 

(a) providing an expression host cell comprising in its 
genome a y- tocopherol methyltransf erase coding sequence 

15 operably connected to a promoter not natively associated with 
the sequence, wherein the promoter is functional in the host 
cell ; 

(b) culturing the host cell under conditions suitable to 
allow expression of the y- tocopherol methyltransf erase ; and 

20 (c) reacting y-tocopherol and S-adenosylmethionine with 

the y-tocopherol methyltransf erase protein of step b under 
suitable conditions and for a period of time sufficient to 
allow conversion of Y-tocopherol to or- tocopherol . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: DellaPenna, Dean 

Shintani, David K. 

5 (ii) TITLE OF INVENTION: TRANSGENIC PLANTS WITH TOCOPHEROL 

ME THYLTRANS FERAS E 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 
10 (A) ADDRESSEE: Quarles & Brady 

(B) STREET: 1 South Pinckney Street 

(C) CITY: Madison 

(D) STATE: WI 

(E) COUNTRY: US 

15 (F) ZIP: 53701-2113 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

2 0 (D) SOFTWARE: Patentln Release #1.0, Version #1,30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

25 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seay, Nicholas J. 

(B) REGISTRATION NUMBER: 27386 

(C) REFERENCE /DOCKET NUMBER: 920905.90024 

fix) TELECOMMUNICATION INFORMATION: 
30 (A) TELEPHONE: 608-251-5000 

(B) TELEFAX: 608-251-9166 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 
4 0 (A) NAME/KEY: CDS 

(B) LOCATION: 1..954 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GTT TAC CAT GTT AGG CCT AAG CAC GCC CTG TTC TTA GCA TTC TAT 48 
Met Val Tyr His Val Arg Pro Lys His Ala Leu Phe Leu Ala Phe Tyr 
45 1 * 5 10 15 

1 
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TGT TAT TTC TCT TTG CTT ACC ATG GCC AGC GCC ACC ATT GCC AGT GCA 96 
Cys Tyr Phe Ser Leu Leu Thr Met Ala Ser Ala Thr He Ala Ser Ala 
20 25 30 

GAC CTC TAC GAA AAA ATT AAA AAT TTC TAC GAC GAC TCC AGC GGT CTC 144 
5 Asp Leu Tyr Glu Lys He Lys Asn Phe Tyr Asp Asp Ser Ser Gly Leu 

35 40 45 

TGG GAA GAC GTT TGG GGT GAG CAT ATG CAC CAC GGC TAC TAC GGT CCC 192 
Trp Glu Asp Val Trp Gly Glu His Met His His Gly Tyr Tyr Gly Pro 
50 55 60 

10 CAC GGC ACC TAT CGG ATC GAT CGC CGC CAG GCT CAA ATT GAT CTG ATC 24 0 

His Gly Thr Tyr Arg lie Asp Arg Arg Gin Ala Gin He Asp Leu He 
65 70 75 80 

AAA GAA CTA TTG GCC TGG GCA GTG CCC CAA AAT AGC GCC AAA CCA CGA 288 
Lys Glu Leu Leu Ala Trp Ala Val Pro Gin Asn Ser Ala Lys Pro Arg 
15 85 90 95 

AAA ATT CTC GAT TTA GGC TGT GGC ATT GGC GGC AGT AGT TTG TAC TTG 3 36 

Lys He Leu Asp Leu Gly Cys Gly He Gly Gly Ser Ser Leu Tyr Leu 
100 105 110 

GCC CAG CAA CAC CAA GCA GAA GTG ATG GGG GCT AGT CTT TCC CCA GTG 3 84 

20 Ala Gin Gin His Gin Ala Glu Val Met Gly Ala Ser Leu Ser Pro Val 

115 120 125 

CAG GTG GAA CGG GCG GGG GAA AGG GCC AGG GCC CTG GGG TTG GGC TCA 43 2 

Gin Val Glu Arg Ala Gly Glu Arg Ala Arg Ala Leu Gly Leu Gly Ser 
130 135 140 

25 ACC TGC CAG TTT CAG GTG GCC AAT GCC TTG GAT TTG CCC TTT GCT TCC 480 

Thr Cys Gin Phe Gin Val Ala Asn Ala Leu Asp Leu Pro Phe Ala Ser 
145 150 155 160 

GAT TCC TTT GAC TGG GTT TGG TCG TTG GAA AGT GGG GAG CAC ATG CCC 52 8 

Asp Ser Phe Asp Trp Val Trp Ser Leu Glu Ser Gly Glu His Met Pro 
30 165 170 175 

AAC AAA GCT CAG TTT TTA CAA GAA GCT TGG CGG GTA CTT AAA CCA GGT 576 
Asn Lys Ala Gin Phe Leu Gin Glu Ala Trp Arg Val Leu Lys Pro Gly 
180 185 190 

GGC CGT CTG ATT TTA GCG ACC TGG TGT CAT CGT CCC ATT GAT CCC GGC 624 
3 5 Gly Arg Leu He Leu Ala Thr Trp Cys His Arg Pro He Asp Pro Gly 

195 200 205 

AAT GGC CCC CTG ACT GCC GAT GAA CGT CGC CAT CTC CAA GCC ATC TAT 672 
Asn Gly Pro Leu Thr Ala Asp Glu Arg Arg His Leu Gin Ala He Tyr 
210 215 220 

40 GAC GTT TAC TGT TTG CCC TAT GTG GTT TCC CTG CCG GAC TAC GAG GCG 720 

Asp Val Tyr Cys Leu Pro Tyr Val Val Ser Leu Pro Asp Tyr Glu Ala 
225 230 235 240 

ATC GCC AGG GAA TGT GGG TTT GGG GAA ATT AAG ACT GCC GAT TGG TCA 768 
He Ala Arg Glu Cys Gly Phe Gly Glu He Lys Thr Ala Asp Trp Ser 
45 245 250 255 

GTG GCG GTG GCA CCT TTT TGG GAC CGG GTG ATT GAG TCT GCG TTC GAT 816 
Val Ala Val Ala Pro Phe Trp Asp Arg Val He Glu Ser Ala Phe Asp 
260 265 270 

CCC CGG GTG TTG TGG GCC TTG GGG CAA GCG GGG CCA AAA ATT ATC AAT 864 
50 Pro Arg Val Leu Trp Ala Leu Gly Gin Ala Gly Pro Lys He He Asn 

275 280 285 
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GCC GCC CTG TGT TTA CGA TTA ATG AAA TGG GGC TAT GAA CGG GGA TTA 912 

Ala Ala Leu Cys Leu Arg Leu Met Lys Trp Gly Tyr Glu Arg Gly Leu 
290 295 300 

GTG CGT TTT GGC TTA TTA ACG GGG ATA AAG CCT TTA GTT TGA 954 

Val Arg Phe Gly Leu Leu Thr Gly lie Lys Pro Leu Val * 
305 310 315 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Val Tyr His Val Arg Pro Lys His Ala Leu Phe Leu Ala Phe Tyr 
15 1 5 10 15 

Cys Tyr Phe Ser Leu Leu Thr Met Ala Ser Ala Thr lie Ala Ser Ala 
20 25 30 

Asp Leu Tyr Glu Lys lie Lys Asn Phe Tyr Asp Asp Ser Ser Gly Leu 
35 40 45 

20 Trp Glu Asp Val Trp Gly Glu His Met His His Gly Tyr Tyr Gly Pro 

50 55 60 

His Gly Thr Tyr Arg lie Asp Arg Arg Gin Ala Gin lie Asp Leu lie 
65 70 75 80 

Lys Glu Leu Leu Ala Trp Ala Val Pro Gin Asn Ser Ala Lys Pro Arg 
25 " 85 90 95 

Lys lie Leu Asp Leu Gly Cys Gly lie Gly Gly Ser Ser Leu Tyr Leu 
100 105 110 

Ala Gin Gin His Gin Ala Glu Val Met Gly Ala Ser Leu Ser Pro Val 
115 120 125 

3 0 Gin Val Glu Arg Ala Gly Glu Arg Ala Arg Ala Leu Gly Leu Gly Ser 

130 135 140 

Thr Cys Gin Phe Gin Val Ala Asn Ala Leu Asp Leu Pro Phe Ala Ser 
145 150 155 160 

Asp Ser Phe Asp Trp Val Trp Ser Leu Glu Ser Gly Glu His Met Pro 
35 165 170 175 

Asn Lys Ala Gin Phe Leu Gin Glu Ala Trp Arg Val Leu Lys Pro Gly 
180 185 190 

Gly Arg Leu He Leu Ala Thr Trp Cys His Arg Pro He Asp Pro Gly 
195 200 205 

4 0 Asn Gly Pro Leu Thr Ala Asp Glu Arg Arg His Leu Gin Ala He Tyr 

210 215 220 

Asp Val Tyr Cys Leu Pro Tyr Val Val Ser Leu Pro Asp Tyr Glu Ala 
225 "* 230 235 240 

He Ala Arg Glu Cys Gly Phe Gly Glu He Lys Thr Ala Asp Trp Ser 
45 245 250 255 
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Val Ala Val Ala Pro Phe Trp Asp Arg Val lie Glu Ser Ala Phe Asp 
260 265 270 

Pro Arg Val Leu Trp Ala Leu Gly Gin Ala Gly Pro Lys lie lie Asn 
275 280 285 

5 Ala Ala Leu Cys Leu Arg Leu Met Lys Trp Gly Tyr Glu Arg Gly Leu 

290 295 300 

Val Arg Phe Gly Leu Leu Thr Gly lie Lys Pro Leu Val * 
305 310 315 

(2) INFORMATION FOR SEQ ID NO : 3 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 207 . . 1253 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

2 0 GCTCGCATGT TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAG CTATGAC 60 

CATGATTACG CCAAGCTCTA ATACGACTCA CTATAGGGAA AGCTGGTACG CCTGCAGGTA 120 

CCGGTCCGGA ATTCCCGGGT CGACCCACGC GTCCGCAAAT AATCCCTGAC TTCGTCACGT 180 

TTCTTTGTAT CTCCAACGTC CAATAA ATG AAA GCA ACT CTA GCA GCA CCC TCT 233 

Met Lys Ala Thr Leu Ala Ala Pro Ser 
25 320 325 

TCT CTC ACA AGC CTC CCT TAT CGA ACC AAC TCT TCT TTC GGC TCA AAG 281 
Ser Leu Thr Ser Leu Pro Tyr Arg Thr Asn Ser Ser Phe Gly Ser Lys 
330 335 340 

TCA TCG CTT CTC TTT CGG TCT CCA TCC TCC TCC TCC TCA GTC TCT ATG 32 9 

3 0 Ser Ser Leu Leu Phe Arg Ser Pro Ser Ser Ser Ser Ser Val Ser Met 

345 350 355 

ACG ACA ACG CGT GGA AAC GTG GCT GTG GCG GCT GCT GCT ACA TCC ACT 377 
Thr Thr Thr Arg Gly Asn Val Ala Val Ala Ala Ala Ala Thr Ser Thr 
360 365 370 375 

35 GAG GCG CTA AGA AAA GGA ATA GCG GAG TTC TAC AAT GAA ACT TCG GGT 425 

Glu Ala Leu Arg Lys Gly lie Ala Glu Phe Tyr Asn Glu Thr Ser Gly 
380 385 ~ 390 

TTG TGG GAA GAG ATT TGG GGA GAT CAT ATG CAT CAT GGC TTT TAT GAC 4 73 

Leu Trp Glu Glu lie Trp Gly Asp His Met His His Gly Phe Tyr Asp 
40 395 400 405 

CCT GAT TCT TCT GTT CAA CTT TCT GAT TCT GGT CAC AAG GAA GCT CAG 521 
Pro Asp Ser Ser Val Gin Leu Ser Asp Ser Gly His Lys Glu Ala Gin 
410 415 420 

ATC CGT ATG ATT GAA GAG TCT CTC CGT TTC GCC GGT GTT ACT GAT GAA 569 
45 lie Arg Met lie Glu Glu Ser Leu Arg Phe Ala Gly Val Thr Asp Glu 

425 430 435 
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GAG GAG GAG AAA AAG ATA AAG AAA GTA GTG GAT GTT GGG TGT GGG ATT 617 
Glu Glu Glu Lys Lys lie Lys Lys Val Val Asp Val Gly Cys Gly lie 
440 445 450 455 

GGA GGA AGC TCA AGA TAT CTT GCC TCT AAA TTT GGA GCT GAA TGC ATT 665 
5 Gly Gly Ser Ser Arg Tyr Leu Ala Ser Lys Phe Gly Ala Glu Cys He 

460 465 470 

GGC ATT ACT CTC AGC CCT GTT CAG GCC AAG AGA GCC AAT GAT CTC GCG 713 
Gly He Thr Leu Ser Pro Val Gin Ala Lys Arg Ala Asn Asp Leu Ala 
475 480 485 

10 GCT GCT CAA TCA CTC TCT CAT AAG GCT TCC TTC CAA GTT GCG GAT GCG 761 

Ala Ala Gin Ser Leu Ser His Lys Ala Ser Phe Gin Val Ala Asp Ala 
490 495 500 

TTG GAT CAG CCA TTC GAA GAT GGA AAA TTC GAT CTA GTG TGG TCG ATG 809 
Leu Asp Gin Pro Phe Glu Asp Gly Lys Phe Asp Leu Val Trp Ser Met 
15 505 510 515 

GAG AGT GGT GAG CAT ATG CCT GAC AAG GCC AAG TTT GTA AAA GAG TTG 857 
Glu Ser Gly Glu His Met Pro Asp Lys Ala Lys Phe Val Lys Glu Leu 
520 525 530 535 

GTA CGT GTG GCG GCT CCA GGA GGT AGG ATA ATA ATA GTG AC A TGG TGC 905 
2 0 Val Arg Val Ala Ala Pro Gly Gly Arg He He He Val Thr Trp Cys 

540 * * 545 550 

CAT AGA AAT CTA TCT GCG GGG GAG GAA GCT TTG CAG CCG TGG GAG CAA 953 
His Arg Asn Leu Ser Ala Gly Glu Glu Ala Leu Gin Pro Trp Glu Gin 
555 560 565 

25 AAC ATC TTG GAC AAA ATC TGT AAG ACG TTC TAT CTC CCG GCT TGG TGC 1001 

Asn He Leu Asp Lys He Cys Lys Thr Phe Tyr Leu Pro Ala Trp Cys 
570 575 580 

TCC ACC GAT GAT TAT GTC AAC TTG CTT CAA TCC CAT TCT CTC CAG GAT 104 9 

Ser Thr Asp Asp Tyr Val Asn Leu Leu Gin Ser His Ser Leu Gin Asp 
30 585 590 595 

ATT AAG TGT GCG GAT TGG TCA GAG AAC GTA GCT CCT TTC TGG CCT GCG 1097 
He Lys Cys Ala Asp Trp Ser Glu Asn Val Ala Pro Phe Trp Pro Ala 
600 ~ 605 610 615 

GTT ATA CGG ACT GCA TTA ACA TGG AAG GGC CTT GTG TCT CTG CTT CGT 114 5 

35 Val He Arg Thr Ala Leu Thr Trp Lys Gly Leu Val Ser Leu Leu Arg 

620 625 630 

AGT GGT ATG AAA AGT ATT AAA GGA GCA TTG ACA ATG CCA TTG ATG ATT 1193 
Ser Gly Met Lys Ser He Lys Gly Ala Leu Thr Met Pro Leu Met He 
635 640 645 

40 GAA GGT TAC AAG AAA GGT GTC ATT AAG TTT GGT ATC ATC ACT TGC CAG 1241 

Glu Gly Tyr Lys Lys Gly Val He Lys Phe Gly lie lie Thr Cys Gin 
650 " 655 660 

AAG CCA CTC TAA GTCTAAAGCT ATACTAGGAG ATTCAATAAG ACTATAAGAG 12 93 
Lys Pro Leu * 
45 665 

TAGTGTCTCA TGTGAAAGCA TGAAATTCCT TAAAAACGTC AATGTTAAGC CTATGCTTCG 13 53 

TTATTTGTTT TAGATAAGTA TCATTTCACT CTTGTCTAAG GTAGTTTCTA TAAACAATAA 1413 

ATACCATGAA TTAGCTCATG TTATCTGGTA AATTCTCGGA AGTGATTGTC ATGGATTAAC 14 73 

TCAAAAAAAA AAAAAAAAAA AGGGCGGCCG CTCTAGAGGA TCCAAGCTTA CGTACGCGTG 1533 
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CATGCGACGT CATAAGTCTA TCATACCGTC GACCTCGAGG GGGGCCCTAA ATTCAATTCA 



1593 



CTGGCCGTCG TTTTACAACG TCGTGACTGG GAAAACCCTG GCGTTACCCA ACTTAATCGC 



1653 - 



CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC 



1713 



CCTTCCCAAC AGTTGCGCAG CCTGAATGGC GAATGGGACG CGCCCTGTAG CGGCGCATTA 



1773 



AGCGCGGCGG GTGTGGT 



1790 



10 



15 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Ala Thr Leu Ala Ala Pro Ser Ser Leu Thr Ser Leu Pro Tyr 
15 10 15 

Arg Thr Asn Ser Ser Phe Gly Ser Lys Ser Ser Leu Leu Phe Arg Ser 
20 25 30 

Pro Ser Ser Ser Ser Ser Val Ser Met Thr Thr Thr Arg Gly Asn Val 
35 40 45 

Ala Val Ala Ala Ala Ala Thr Ser Thr Glu Ala Leu Arg Lys Gly lie 
50 55 60 

Ala Glu Phe Tyr Asn Glu Thr Ser Gly Leu Trp Glu Glu lie Trp Gly 
65 70 75 80 

Asp His Met His His Gly Phe Tyr Asp Pro Asp Ser Ser Val Gin Leu 
85 ' 90 " 95 

Ser Asp Ser Gly His Lys Glu Ala Gin lie Arg Met lie Glu Glu Ser 
100 105 110 

Leu Arg Phe Ala Gly Val Thr Asp Glu Glu Glu Glu Lys Lys lie Lys 
115 120 125 

Lys Val Val Asp Val Gly Cys Gly lie Gly Gly Ser Ser Arg Tyr Leu 
130 * 135 " 140 

Ala Ser Lys Phe Gly Ala Glu Cys lie Gly He Thr Leu Ser Pro Val 
145 150 155 160 

Gin Ala Lys Arg Ala Asn Asp Leu Ala Ala Ala Gin Ser Leu Ser His 
165 170 175 

Lys Ala Ser Phe Gin Val Ala Asp Ala Leu Asp Gin Pro Phe Glu Asp 
180 * 185 190 

Gly Lys Phe Asp Leu Val Trp Ser Met Glu Ser Gly Glu His Met Pro 
195 200 205 

Asp Lys Ala Lys Phe Val Lys Glu Leu Val Arg Val Ala Ala Pro Gly 
210 215 220 

Gly Arg He He He Val Thr Trp Cys His Arg Asn Leu Ser Ala Gly 
225 230 235 240 
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Glu Glu Ala Leu Gin Pro Trp Glu Gin Asn lie Leu Asp Lys lie Cys 
245 250 255 

Lys Thr Phe Tyr Leu Pro Ala Trp Cys Ser Thr Asp Asp Tyr Val Asn 
260 265 270 

5 Leu Leu Gin Ser His Ser Leu Gin Asp lie Lys Cys Ala Asp Trp Ser 

275 280 285 

Glu Asn Val Ala Pro Phe Trp Pro Ala Val lie Arg Thr Ala Leu Thr 
290 295 300 

Trp Lys Gly Leu Val Ser Leu Leu Arg Ser Gly Met Lys Ser lie Lys 
10 305 310 315 320 

Gly Ala Leu Thr Met Pro Leu Met lie Glu Gly Tyr Lys Lys Gly Val 
325 330 335 

lie Lys Phe Gly lie He Thr Cys Gin Lys Pro Leu * 
340 345 

15 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
2 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ACGGATCCAA AAATGCCTAT GGTTCATCAT CGGGG 3 5 

25 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GGGGATCCTG TGGACTTCAA ACTAAAGGCT TTATC 3 5 

(2) INFORMAT I ON FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 



7 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CCTCATGATT TACCATGTTA GGCC 24 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
10 (A) DESCRIPTION: /desc = "Oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

AGATCTCAAA CTAAAGGCTT TATC 24 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
20 (A) DESCRIPTION: /desc = "Oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCATGCTGTG GCGGCTGCTG CTAC 24 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
3 0 (A) DESCRIPTION: /desc = "Oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTCGACGCAT GCACGCGTAC GTAA 24 

QBMAD\162725 



8 
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